Comparing compact codebooks for visual categorization

نویسندگان

Jan C. van Gemert

Cees Snoek

Cor J. Veenman

Arnold W. M. Smeulders

Jan-Mark Geusebroek

چکیده

In the face of current large-scale video libraries, the practical applicability of content-based indexing algorithms is constrained by their efficiency. This paper strives for efficient large-scale video indexing by comparing various visual-based concept categorization techniques. In visual categorization, the popular codebook model has shown excellent categorization performance. The codebook model represents continuous visual features by discrete prototypes predefined in a vocabulary. The vocabulary size has a major impact on categorization efficiency, where a more compact vocabulary is more efficient. However, smaller vocabularies typically score lower on classification performance than larger vocabularies. This paper compares four approaches to achieve a compact codebook vocabulary while retaining categorization performance. For these four methods, we investigate the trade-off between codebook compactness and categorization performance. We evaluate the methods on more than 200 hours of challenging video data with as many as 101 semantic concepts. The results allow us to create a taxonomy of the four methods based on their efficiency and categorization performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Création de Vocabulaires Visuels Efficaces pour la Catégorisation d’Images. Creating Efficient Visual Codebooks for Image Categorization

We propose in this article an automatic method for building visual codebooks. Codebooks are obtained by quantizing local image descriptors and are used to automatically build discriminative representations of objects occuring in images. We describe an image categorization application based on the proposed approaches, providing results far above related state of the art existing methods.

متن کامل

Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines

Recently, the coding of local features (e.g. SIFT) for image categorization tasks has been extensively studied. Incorporated within the Bag of Words (BoW) framework, these techniques optimize the projection of local features into the visual codebook, leading to state-of-theart performances in many benchmark datasets. In this work, we propose a novel visual codebook learning approach using the r...

متن کامل

Speeded-up and Compact Visual Codebook for Object Recognition

The well known framework in the object recognition literature uses local information extracted at several patches in images which are then clustered by a suitable clustering technique. A visual codebook maps the patch-based descriptors into a fixed-length vector in histogram space to which standard classifiers can be directly applied. Thus, the construction of a codebook is an important step wh...

متن کامل

Kernel Codebooks for Scene Categorization

This paper introduces a method for scene categorization by modeling ambiguity in the popular codebook approach. The codebook approach describes an image as a bag of discrete visual codewords, where the frequency distributions of these words are used for image categorization. There are two drawbacks to the traditional codebook model: codeword uncertainty and codeword plausibility. Both of these ...

متن کامل

Analysis of Visual Impacts in Compact City’s Form

Desired physical form of cities has been noticeable since the beginning of urbanization, from old patterns of early civilizations to the latest urbanism’s theories, which offered to build better cities. The opinions in recent decades have expressed that compact physical form of cities is a better form than sprawl form to achieve urban sustainability. The form of the city is the embodiment of it...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Computer Vision and Image Understanding

دوره 114 شماره

صفحات -

تاریخ انتشار 2010

Comparing compact codebooks for visual categorization

نویسندگان

چکیده

منابع مشابه

Création de Vocabulaires Visuels Efficaces pour la Catégorisation d’Images. Creating Efficient Visual Codebooks for Image Categorization

Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines

Speeded-up and Compact Visual Codebook for Object Recognition

Kernel Codebooks for Scene Categorization

Analysis of Visual Impacts in Compact City’s Form

عنوان ژورنال:

اشتراک گذاری